CS 229 Final Project: Single Image Depth Estimation From Predicted Semantic Labels

نویسندگان

  • Beyang Liu
  • Stephen Gould
  • Daphne Koller
چکیده

Recovering the 3D structure of a scene from a single image is a fundamental problem in computer vision that has applications in robotics, surveillance, and general scene understanding. However, estimating structure from raw image features is notoriously difficult since local appearance is insufficient to resolve depth ambiguities (e.g., sky and water regions in an image can have similar appearance but dramatically different geometric placement within a scene). Intuitively, semantic understanding of a scene plays an important role in our own perception of scale and 3D structure. Our goal is to estimate the depth of each pixel in an image. We employ a two phase approach: In the first phase, we use a learned multi-class image labeling MRF to estimate the semantic class for each pixel in the image. We currently label pixels as one of: sky, tree, road, grass, water, building, mountain, and foreground object. In the second phase, we use the predicted semantic class labels to inform our depth reconstruction model. Here, we first learn a separate depth estimator for each semantic class. We incorporate these predictions in a Markov random field (MRF) that includes semantic-aware reconstruction priors such as smoothness and orientation. Motivated by the work of Saxena et. al., [6], we explore both pixelbased and superpixel-based variants of our model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semiautomatic Image Retrieval Using the High Level Semantic Labels

Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...

متن کامل

Deep Learning for Depth Learning CS 229 Course Project, 2013 Fall

Extracting 3D depth information from images is a classic problem of computer vision. Traditionally image depth could be extracted by techniques such as stereo camera or images from multiple views. In this project, we are trying to recognize the depth information by using a single still image from single camera, which has great potential applications in vision and recognition. To learn the compl...

متن کامل

Lifting GIS Maps into Strong Geometric Context

Contextual information can have a substantial impact on the performance of visual tasks such as semantic segmentation, object detection, and geometric estimation. Data stored in Geographic Information Systems (GIS) offers a rich source of contextual information that has been largely untapped by computer vision. We propose to leverage such information for scene understanding by combining GIS res...

متن کامل

Monocular Depth Estimation by Learning from Heterogeneous Datasets

Depth estimation provides essential information to perform autonomous driving and driver assistance. Especially, Monocular Depth Estimation is interesting from a practical point of view, since using a single camera is cheaper than many other options and avoids the need for continuous calibration strategies as required by stereo-vision approaches. State-of-theart methods for Monocular Depth Esti...

متن کامل

Strategic Decision-Making Learning from Label Distributions: An Approach for Facial Age Estimation

Nowadays, label distribution learning is among the state-of-the-art methodologies in facial age estimation. It takes the age of each facial image instance as a label distribution with a series of age labels rather than the single chronological age label that is commonly used. However, this methodology is deficient in its simple decision-making criterion: the final predicted age is only selected...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009